Language Identification With Confidence Limits

نویسنده

  • David Elworthy
چکیده

A statistical classification algorithm and its application to language identification from noisy input are described. The main innovation is to compute confidence limits on the classification, so that the algorithm terminates when enough evidence to make a clear decision has been made, and so avoiding problems with categories that have similar characteristics. A second application, to genre identification, is briefly examined. The results show that some of the problems of other language identification techniques can be avoided, and illustrate a more important point: that a statistical language process can be used to provide feedback about its own success rate.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Language Identification With Confidence Limits

A statistical classification algorithm and its application to language identification from noisy input are described. The main innovation is to compute confidence limits on the classification, so that the algorithm terminates when enough evidence to make a clear decision has been made, and so avoiding problems with categories that have similar characteristics. A second application, to genre ide...

متن کامل

Reflective Teaching in ELT: Obstacles and Coping Strategies

The present study aimed to document the constraints and limits in applying reflective teaching principles in ELT settings in Iran from the teachers’ perspective along with solutions and coping strategies to help remove the obstacles. 60 teachers teaching general English at 6 language institutes were selected through convenience sampling. First, the teacher participants filled out a reflectivity...

متن کامل

The Relationship betweenEFL Learners’ Self-Identity Changes, Motivation Types, and EFL Proficiency

This study aimed to explore the relationships between foreign language learners’ self-identity changes, motivation types, and Foreign Language proficiency associated with learning English in private language schools in Iranian context. Based on a stratified sampling, 204 English as a foreign language learners from three language schools in Tehran were selected to participate in the study. The i...

متن کامل

Evaluation of confidence measures for language identification

In this paper we examine various ways to derive confidence measures for a language identification system [3], using phone recognition followed by language models, and describe the application of an evaluation metric [1] for measuring the “goodness” of the different confidence measures. Experiments are conducted on the 1996 NIST Language Identification Evaluation corpus (derived from the Callfri...

متن کامل

مقایسه روش های طیفی برای شناسایی زبان گفتاری

Identifying spoken language automatically is to identify a language from the speech signal. Language identification systems can be divided into two categories, spectral-based methods and phonetic-based methods. In the former, short-time characteristics of speech spectrum are extracted as a multi-dimensional vector. The statistical model of these features is then obtained for each language. The ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره cs.CL/9907010  شماره 

صفحات  -

تاریخ انتشار 1998